Ab initio protein structure prediction on a genomic scale: application to the Mycoplasma genitalium genome.

نویسندگان

  • Daisuke Kihara
  • Yang Zhang
  • Hui Lu
  • Andrzej Kolinski
  • Jeffrey Skolnick
چکیده

An ab initio protein structure prediction procedure, TOUCHSTONE, was applied to all 85 small proteins of the Mycoplasma genitalium genome. TOUCHSTONE is based on a Monte Carlo refinement of a lattice model of proteins, which uses threading-based tertiary restraints. Such restraints are derived by extracting consensus contacts and local secondary structure from at least weakly scoring structures that, in some cases, can lack any global similarity to the sequence of interest. Selection of the native fold was done by using the convergence of the simulation from two different conformational search schemes and the lowest energy structure by a knowledge-based atomic-detailed potential. Among the 85 proteins, for 34 proteins with significant threading hits, the template structures were reasonably well reproduced. Of the remaining 51 proteins, 29 proteins converged to five or fewer clusters. In the test set, 84.8% of the proteins that converged to five or fewer clusters had a correct fold among the clusters. If this statistic is simply applied, 24 proteins (84.8% of the 29 proteins) may have correct folds. Thus, the topology of a total of 58 proteins probably has been correctly predicted. Based on these results, ab initio protein structure prediction is becoming a practical approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chapter 11: Genome-wide protein structure prediction

The post-genomic era has witnessed an explosion of protein sequences in the public databases; but this has not been complemented by the availability of genome-wide structure and function information, due to the technical difficulties and labor expenses incurred by existing experimental techniques. The rapid advancements in computer-based protein structure prediction methods have enabled automat...

متن کامل

Gene structure identification with MyGV using cDNA evidence and protein homologs to improve ab initio predictions

UNLABELLED MyGV is an application to visualize (potentially genome-scale) gene structure annotation and prediction. The output of any external gene prediction program can be easily converted to a generalized format for input into MyGV. The application displays all input simultaneously in graphical representation, with a toggle option for a text-based view. Zooming capabilities allow detailed co...

متن کامل

A Genome-Scale Metabolic Reconstruction of Mycoplasma genitalium, iPS189

With a genome size of approximately 580 kb and approximately 480 protein coding regions, Mycoplasma genitalium is one of the smallest known self-replicating organisms and, additionally, has extremely fastidious nutrient requirements. The reduced genomic content of M. genitalium has led researchers to suggest that the molecular assembly contained in this organism may be a close approximation to ...

متن کامل

A study on the frequency of vaginal species of Mycoplasma genitalium, Gardnerella vaginalis and Neisseria gonorrhoeae among pregnant women by PCR technique

Bacterial vaginosis or non-specific vaginitis describes the disease caused by a change in the normal Flora of the vagina, which leads to the elimination of Lactobacilli, generating hydrogen peroxide and excess growth of bacteria, particularly anaerobic bacteria. This disease is the most prevalent infection of the female genital tract, and the rate of frequency of anaerobic bacteria, specificall...

متن کامل

Genomic Analysis of the Genes Encoding Ribosomal Proteins in Eight Eubacterial Species and Saccharomyces cerevisiae.

The complete genomic nucleotide sequence data of more than 10 unicellular organisms have become available. During the past years, we have been focusing our attention to the analysis of the structure and function of the ribosome and its protein components. By making use of the genomic sequence data, our work can now be extended to comparative analysis of the ribosomal components at the genomic l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 99 9  شماره 

صفحات  -

تاریخ انتشار 2002